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Four Massachusetts-type (Mass-type) strains of infectious bronchitis coronavirus (IBV) were compared 
genetically with the pathogenic M41 and HI 20 vaccine strains using the complete genomic sequences. 
The results revealed that strains ck/CH/LNM/091017 and ck/CH/LDL/101212 were closely related to the 
H120 vaccine, which suggests that they might represent re-isolations of vaccine strains or variants of vac¬ 
cine strains that have resulted from the accumulated point mutations after several passages in chickens. 
In contrast, strains ck/CH/LHLJ/07VII and ck/CH/LHLJ/100902 had a close genetic relationship with the 
pathogenic M41 strain. In addition, molecular markers have been identified that distinguish between 
field and vaccine (or vaccine-like) Mass-type viruses, which may be able to differentiate between field 
and vaccine strains for diagnostic purposes. Phylogenetic analysis, and pairwise comparison of full-length 
genomes and the nine genes, identified the occurrence of recombination events in the genome of strain 
CK/VH/LHLJ/07VII, which suggests that this virus originated from recombination events between 
M41- and HI 20-like strains at the switch site located at the 3' end of the nucleocapsid (N) genes. To 
our knowledge, this is the first time that evidence for the evolution and natural recombination under field 
conditions between Mass-type pathogenic and vaccinal IBV strains has been documented. These findings 
provide insights into the emergence and evolution of the Mass-type IB coronaviruses and may help to 
explain the emergence of Mass-type IBV in chicken flocks all over the world. 

© 2012 Elsevier B.V. All rights reserved. 


1. Introduction 

In 1931, Schalk and Hawn described “an apparently new respi¬ 
ratory disease of chicks” in North Dakota in the United States, 
which was considered to be infectious bronchitis (IB) by later 
researchers of avian respiratory diseases (Schalk and Hawn, 
1931). Currently, IB still occurs in nearly all poultry-producing 
countries; it is a highly contagious, acute, and economically impor¬ 
tant viral disease of chickens. The etiology of IB, which was first 
demonstrated by Beach and Schalm (1936), is infectious bronchitis 
virus (IBV). 

IBV is grouped in the genus Gammacoronavirus of the family 
Coronaviridcie in the order Nidovirales (de Groot et al„ 2011). The 
coronavirus genomes are the largest among the known RNA 
viruses and are polycistronic, generating a nested set of subgenom- 
ic RNAs with common 5' and 3' sequences (Masters, 2006). Like 
those of all other coronaviruses, the 5' two-thirds of the IBV 
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genome consists of two large replicase open reading frames (ORFs), 
ORFla and ORFlb. The ORFla polyprotein (ppla) can be extended 
with ORFlb-encoded sequences via a -1 ribosomal frameshift at a 
conserved slippery site (Brierley et al„ 1987), which generates the 
polyprotein pplab, comprising more than 7000 amino acids, which 
includes the putative RNA-dependent RNA polymerase (RdRp) and 
RNA helicase (HEL) activity (Ziebuhr et al„ 2001). The ppla and 
pplab of IBV are processed autocatalytically by two different viral 
proteases, encoded by a papain-like protease (PLP) and a 3C-loke 
protease (3CL pro ) (Lee et al„ 1991; Ziebuhr et aL, 2000, 2001). Other 
putative domains, presumably associated with a 3'-to-5' exonucle¬ 
ase (ExoN) activity, a poly(U)-specific endo-RNase (XendoU) activ¬ 
ity, and a 2'-0-methyltransferase (2'-0-MT) activity, have been 
predicted in pplab (Ivanov et al„ 2004; Snijder et al., 2003). The 
3' end of a coronavirus genome includes the viral structural and 
accessory protein genes: a spike (S) glycoprotein gene; an envelope 
(E) protein gene; a membrane (M) glycoprotein gene; a nucleocap¬ 
sid (N) phosphoprotein gene; and several ORFs that encode puta¬ 
tive non-structural accessory proteins (Masters, 2006). Of the 
virus-encoded proteins, the SI subunit of the S protein carries 
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virus-neutralizing activity, determines the serotype of 1BV and is 
responsible for viral attachment to cells. It is also a major determi¬ 
nant of cell tropism in culture (Casais et al„ 2003). The accumula¬ 
tion of point mutations, deletions, insertions and recombination 
events that have been observed in multiple structural genes, espe¬ 
cially the SI gene, of IBV recovered from naturally occurring infec¬ 
tions have been considered to contribute to the genetic diversity 
and evolution of IBV, and consequently, to a number of IBV sero¬ 
types (Cavanagh, 2007). 

The occurrence and emergence of multiple serotypes of the 
virus have complicated control by vaccination because many sero¬ 
types and variants do not confer complete cross-protection against 
each other (Cavanagh and Gelb, 2008). The originally discovered 
Massachusetts (Mass) type of IBV was identified in the United 
States, beginning in the 1950s (Fabricant, 2000; Johnson and 
Marquardt, 1975; Mondal et al., 2001). Mass-type strains have 
been isolated in Europe and Asia since the 1950s and up to the 
present day (Cavanagh and Gelb, 2008), together with dozens of 
other serotypes that have been isolated in Africa, Asia, India, 
Australia, Europe, and South America (Cavanagh, 2001, 2003, 
2005). The first Mass-type “H” vaccines were developed in about 
1960. They include H120 and H52 (Bijlenga et al„ 2004), and are 
used very commonly and widely around the world. However, virus 
of this type is occasionally isolated from Massachusetts-vaccinated 
and -unvaccinated flocks with respiratory clinical signs. Some of 
the viruses have shown close genetic relationships with pathogenic 
Mass-type, rather than vaccine, strains by SI gene analysis. 
However, conclusions based on the genetic analysis of a single 
gene sequence, and sometimes even a partial gene sequence, 
require caution because the true phylogeny can only be demon¬ 
strated by analyzing complete genomic sequences. Herein, we 
sequenced the complete genome of four IBV Mass-type strains that 
showed SI gene diversity (Liu et al., 2009; Ma et al„ 2012; Sun 
et al., 2011), and we present evidence for in-field recombination 
between pathogenic and vaccinal strains. Furthermore, we charac¬ 
terized the molecular variability of the four Mass-type strains to 
gain insight into the emergence and evolution of these viruses. 


2. Materials and methods 

2.1. Virus strains 

Four Mass-type IBV strains were used for complete genomic se¬ 
quence comparison and analysis in this study. Strain ck/CH/LHLJ/ 
07VII was isolated in 2007 from the kidney of a layer hen vacci¬ 
nated with HI20 and 4/91 in Heilongjiang province, China (Liu 
et al., 2009). Strain ck/CH/LNM/091017 was isolated in 2009 from 
the swollen proventricular tissues of a broiler vaccinated with 
HI20 in Neimenggu province, China (Sun et al., 2011). Strains ck/ 
CH/LDL/101212 and ck/CH/LHLJ/100902, both of which were iso¬ 
lated in 2010, were obtained from laying hens in Dalian and Hei¬ 
longjiang provinces, respectively, in China; the birds were 
suffering from nephropathogenic lesions and respiratory signs, 
respectively. In addition, the diseased birds in both flocks were suf¬ 
fering from proventriculitis (Ma et al., 2012). All of the IBV strains 
have been associated with various IB outbreaks in recent years in 
China and were assigned to the Mass-type strains by SI sequence 
analysis. To avoid the possible mutation in the viral genome after 
serial passages in specific-pathogen-free (SPF) embryonated chick¬ 
en eggs, the first passage of each original virus stock was used and 
purified once by propagating in 9- to 11-day-old SPF chicken eggs 
with a dose of 10 4 -fold dilutions per egg, and the presence of viral 
particles in the allantoic fluids of inoculated eggs was confirmed 
with a negative contrast electronic microscope (JEM-1200, EX) 
and reverse transcriptase-polymerase chain reaction (RT-PCR) as 


described previously (Han et al., 2011). In addition, since these 
viruses were isolated from chickens vaccinated with HI20, it is 
possible that mixed IBV infections are present in one chicken flock. 
To exclude this, nine clones of SI gene of each virus obtained from 
three independent PCR reactions were sequenced and compared. 
Sequences of each virus identical to the previously results were ob¬ 
tained (Liu et al., 2009; Ma et al., 2012; Sun et al., 2011). 

2.2. Eggs 

Fertile White Leghorn SPF chicken eggs were obtained from the 
Laboratory Animal Center, Harbin Veterinary Research Institute, 
the Chinese Academy of Agricultural Sciences, China. 

2.3. RNA extraction and RT-PCR 

To determine the full-length genomic sequences of the four 
viruses, 15 pairs of overlapping primers encompassing the entire 
genome were used. The primers were designed in regions that 
are conserved among most of the IBV strains available in the Gen- 
Bank database. The sequences and locations of the primers used in 
RT and PCR in this study are presented in Table 1. 

Viral RNA was extracted from 200 pi of infectious allantoic fluid 
using TRIzol reagents (Invitrogen, Grand Island, USA), following the 
manufacturer’s protocol. Complementary DNA (cDNA) was synthe¬ 
sized using 80 pi of the first strand mixture (Invitrogen) containing 
20 pM of primers N (-), 0.5 mM each of dNTP (TaKaRa, Dalian, Chi¬ 
na) and 40 pi of total RNA. The mixture was incubated at 70 °C for 
5 min and then quick-chilled on ice for 2 min. The RT master mix 
was composed of 16 pi 5x RT buffer (Invitrogen), 8 pi 10 mM 
DTT, 200 U of M-MLV Reverse Transcriptase (Invitrogen), and 
20 U RNAse inhibitors (Invitrogen). This RT master mix was incu¬ 
bated at 37 °C for 3 h. The reaction was terminated by heating at 
70 °C for 10 min then chilling on ice for 5 min. 

The PCR was performed in a 25 pi reaction containing 2 pi first 
strand cDNA; 15 nmol each of downstream and upstream primers; 
5 pi of lOx PCR buffer (Mg 2+ Plus, TaKaRa); 4 pi of 2.5 mmol 
dNTPs; 2 U Taq polymerase (TaKaRa); and 18 pi of water. The reac¬ 
tion was conducted at 95 °C for 5 min, and 30 cycles of 94 °C for 
1 min; 50 °C for 1 min; 72 °C for 2 min, and a final extension step 
of 72 °C for 10 min. A product, detectable by ethidium bromide 
staining, of the expected size was generated. 

2.4. The 5'- and 3'-ends of the genome 

A cDNA clone representing the 5' and 3' ends of the genome of 
the four IBV strains was synthesized according to the 5' RACE and 
3' RACE System for rapid amplification of cDNA ends (TaKaRa). PCR 
was performed according to the instructions accompanying the 
kits. The sense and antisense primers used to amplifying the 5'- 
and 3'-ends of the genome had been designed on the basis of the 
sequences obtained above that were constant in the four IBV 
strains, respectively. The outer and inner primers used to amplify 
the 5'-end of the four IBV strains were 5'-CAGCTATGGCAATGCG 
CAG-3' and 5'-CATCTTTGGTGTCTCA/TCC-3', respectively. The pri¬ 
mer used to amplify the 3'-end was 5'-GAGGAGAGGAACAATGC 
ACA-3'. 

2.5. DNA cloning and sequence determination 

The DNA generated by PCR amplification was cloned using a T- 
tailed vector, pMD18-T (TaKaRa), and transformed using JM109 
competent cells (TaKaRa) according to the manufacturer’s instruc¬ 
tions. At least five clones of each fragment in each strain were se¬ 
quenced and the consensus sequence was determined. The 
sequences were analyzed using the Sequencher 4.5 sequence 
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Table 1 

Oligonucleotide primers used for 1BV genome amplification and their positions in the genome. 


Primer 

Sense 3 

Sequence (5' to 3') 

Position in genome b 

IBV-366 

+ 

AACCCAAAAGATTACGCTGATGCTT 

640-664 

IBV-367 

- 

CCAGAATTCAAAAGACTTTTCAGTA 

1887-1911 

IBV-368 

+ 

GATGTCTTGAAGCTGTTTCAATC 

1828-1850 

IBV-369 

- 

AAGAATGTGCTTGTAGATGGCGT 

3817-3839 

IBV-370 

+ 

TATGTTAAGAAACATGGGCCAC 

3685-3706 

IBV-371 

- 

TACATTTATGTCTCCTGAAGTTGCTGG 

6405-6431 

IBV-372 

+ 

GTAAGAGACATAATTGGTATTG 

6353-6374 

IBV-373 

- 

GGTGCAGTTTTAATTTTACAAACTGC 

9052-9077 

IBV-377 

+ 

CTTAATCTTGCTAATAATCATGAG 

8971-8994 

IBV-378 

- 

GTGTACAACTGTTGTTTTAATGCAGC 

10,686-10,711 

IBV-379 

+ 

GTGTGGGAAGTCTTTTCGACAAATATAC 

10,588-10,615 

IBV-380 

- 

G G ACGTTGT C AATTT AAAG GTTC 

12,100-12,122 

IBV-381 

+ 

CTCCTGATCAGGATTCTTATGGAGGAGC 

12,011-12,038 

IBV-382 

- 

CCTGTGGCTGTTATGGAGCGTTATAT 

14,835-14,860 

IBV-383 

+ 

C ATTTT G G GTG CAT GT GTTTTT GTAG AT G 

14,789-14,817 

IBV-384 

- 

G ACACCAT GT G AC AAAC ACTAC AC AAT CT G 

17,407-17,436 

IBV-375 

+ 

CCTTGGCATGTTATAAGACCAAGGATAG 

17,346-17,373 

IBV-376 

- 

CGCTCCATCTTTTAATGGCATACCATTCTG 

18,903-18,932 

IBV-374 

+ 

GTTCTGTATGATGATAGATATGGTGATTACC 

18,780-18,810 

IBV-105-1 

- 

CCAAAGTGCCTTTAGACCGGCTG 

20,388-20,412 

IBV-257 

+ 

TATTGATTAGAGATGTGG 

20,356-20,373 

IBV-275 

- 

GTTACAGATGAGTACATAC 

22,147-22,165 

IBV-167 

+ 

GCTTCTTGAGAA(T/C)CAGTTTTA 

21,921-21,941 

IBV-168 

- 

AG ACG AT C AACTT GTG C ATCTG 

22,952-22,973 

IBV-182 

+ 

GACAnTAC(C/G)(A/C)GCAACTTGA 

22,921-22,940 

IBV-281 

- 

GCCACTGACC(C/A)TCACAATAAAG 

24,955-24,976 

IBV-280 

+ 

CCC(C/A)GAATCTAATGCCGTAGG 

24,846-24,866 

IBV-171 

- 

AACCAAGATGCATTTCCAGA 

25,960-25,979 

N(+) 

+ 

GACGCCCCAGCGCCAGTCATTAAA 

25,903-25,926 

N(— ) c 

- 

ACGCGGAGTACGATCGAGGGTACA 

27,484-27,507 


a Negative-sense (-) or positive-sense (+) primer. 

b Nucleotide positions corresponded to the sequence of the IBV Beaudette genome, GenBank accession number M95169. 
c Primer N(-) was used for RT. 


Table 2 

IBV and TCoV strains used for genome sequence comparison in the present study. 


Virus 

GenBank accession Nos. 

Virus 

GenBank accession Nos. 

A2 

EU526388 

CK/CH/LDL/97I 

AF345267 

SAIBK 

DQ288927 

Gray 

GU393334 

SC021202 

EU714029 

FL18288 

GU393333 

BJ 

AY319651 

Conn46 1991 

FJ904719 

CK/CH/LSD/05I 

EU637854 

DY07 

HM245923 

Partridge/GD/S14/2003 

AY646283 

GX-YL 5 

HQ848267 

Peafowl/GD/KQ6/2003 

AY641576 

CQ04 

HM245924 

ZJ971 

EU714028 

GX-YL 9 

HQ850618 

TW2575/98 

DQ646405 

JMK 

GU393338 

H52 

EU817497 

Cal 1995 

FJ904714 

H120 

FJ888351 

Georgia 1998 Vaccine 

GQ504723 

Mass 41 

AY851295 

Mass41 1965 

FJ904720 

California 99 

AY514485 

Delaware 072 

GU393332 

Beaudette 

NC_001451 

Arkansas DPI 

GQ504720 

ArkDPIll 

EU418976 

ck/CH/LZJ/l 11113 

JX195176 

Iowa 97 

GU393337 

Sczy3 

JF732903 

TCoV/MG10 

NC_010800 

Holte 

GU393336 

TCoV/TX/1038/98 

GQ427176 

ck/CH/LDL/091022 

JX195175 

TCoV/IN/517/94 

GQ427175 

YN 

JF893452 

TCoV/TX/GL/01 

GQ427174 

Cal56b 

GU393331 

TCoV/VA/74/03 

GQ427173 

ck/CH/LHB/100801 

JF330898 

TCoV/540 

EU022525 

ITA/90254/2005 

FN430414 

TCoV/ATCC 

EU022526 

NGA/A116E7/2006 

FN430415 


analysis program, and a single contiguous sequence comprising the 
entire IBV genome of each of the four IBV strains was constructed. 

2.6. Phylogenetic relationships and sequence comparison 

The nucleotide and amino acid sequences of the entire genome 
of the four IBV strains were assembled, aligned, and compared with 


those of other reference IBV and Turkey coronavirus (TCoV) strains 
using the MEGALIGN program in DNAStar (version 7, Lasergene 
Corp, Madison, WI). The ORFs were determined using the Gene 
Runner program version 3.00 (http://www.generunner.com) by 
comparison with those of other reference IBV and TCoV strains. A 
total of 39 IBV and 7 TCoV reference strains, for which entire geno¬ 
mic sequences were available in GenBank database, were selected 
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Cleaved by papain-like protease (PLpro) 
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Fig. 1 . Genome organization of four Mass-type 1BV isolates. Diagrammatic representation of the genome organization shows the predicted genes and their relative sizes and 
positions along the 1BV genome. S, spike glycoprotein gene; 3a, 3b, and 3c (E), tricistronic gene 3; M, membrane protein gene; 5a and 5b, bicistronic gene 5; N, nucleocapsid 
protein gene; UTR, untranslated region. Top: expanded representation of the two ORFs (ORFla and ORFlb) comprising the polycistronic Gene 1 and the likely cleavage 
products and cleavage sites after proteolytic processing of the 1 a/1 b polyprotein. Bottom: expanded representation of the 3'-end structural and accessory protein genes. 


for phylogenetic analysis of full-length genomes. The selected 
avian coronavirus reference strains and their accession numbers 
are provided in Table 2. Phylogenetic analysis, accurate estimation 
and comparison of the 5'-UTR, Gene 1, SI, S2, Gene 3, M, Gene 5, N 
and 3'-UTR of the four 1BV strains was conducted with those of the 
Mass-type strains selected in this study using the Clustal V method 
of DNAStar software and MEGA4 (Liu et al„ 2009), and the align¬ 
ments were edited manually and adjusted to remove mistakes. 
Deletion, insertion and gene recombination were determined 
according to the results of the phylogenetic analysis and pairwise 
comparisons. 

2.7. CenBank accession numbers 

The full genomic sequences of the four Mass-type IBV strains 
described in this report have been deposited in the GenBank data¬ 
base with accession numbers ck/CH/LNM/091017 JF330899, CI</ 
CH/LHLJ/07V11 JF274479, ck/CH/LDL/101212 JF828981 and CI</ 
CH/LHLJ/100902 JF828980. 

3. Results 

3.1. Complete genomic sequence and organization of the four Mass- 
type IBV strains 

Four Mass-type IBV strains were subjected to genome sequenc¬ 
ing and phylogenetic analysis in this study. The sequences of each 
the four strains were assembled into one contiguous sequence to 
represent the entire viral genomes. Sequences of 27630, 27678, 
27630 and 27473 nucleotides were obtained from strains ck/CH/ 
LNM/091017, ck/CH/LHLJ/07VII, ck/CH/LDL/101212 and ck/CH/ 
LHLJ/100902, respectively, excluding the polyadenylation tail at 
the 3' end. The genomes of the viruses were similar overall in their 
coding capacity and genomic organization to those of other IBVs. 
The genome of each of the viruses contained two large slightly 
overlapping ORFs in the 5' two-thirds of the genome and multiple 
additional ORFs in the 3' one-third of the genome. Both termini 
were flanked with untranslated regions (UTRs). Ten ORFs were 
identified within the genome. Gene 1 contained motifs common 
to all coronaviruses, including ribosomal frameshifting and slip¬ 
pery sequences, because ORFlb is translated in the -1 frame. 
The typical coronavirus structural genes encoding the S, E, M and 


N proteins were identified following Gene 1 (Fig. 1). The genome 
organization was determined to be as follows: 5'-UTR-Gene 1 
(ORFla, lb)-S-Gene 3 (ORFs 3a. 3b, E)-M-Gene 5 (ORFs 5a, 5b)-N- 
UTR-3'. 

3.2. Strains ck/CH/LNM/091017 and ck/CH/LDL/101212 are closely 
related to the H120 vaccine 

The analysis of the complete genome showed that strains ck/CH/ 
LNM/091017 and ck/CH/LDL/101212 possessed 99.9% and 99.8% 
nucleotide identity with HI20, respectively. However, they shared 
91.3% and 91.4% identity with M41, respectively. Phylogenetic anal¬ 
ysis using the full-length genome and the 5'-UTR, Gene 1, SI, S2, 
Gene 3, M, Gene 5, N and 3'-UTR showed that the IBV strains ck/ 
CH/LNM/091017 and ck/CH/LDL/101212 consistently formed the 
same clade with vaccine-related strains of Mass-type (Figs 2 and 
3). The analysis of the SI gene showed that strains ck/CH/LNM/ 
091017 and ck/CH/LDL/101212 had high nucleotide identities 
(99.8% and 99.8%, respectively) with HI20, while they had 97.7% 
and 97.7% identity with M41. Multiple alignments revealed that 
there were 2 and 1 nucleotide mutations within the SI gene be¬ 
tween strains ck/CH/LNM/091017 and ck/CH/LDL/101212 and 
HI20; however, there were 36 and 39 mutations between strains 
ck/CH/LNM/091017 and ck/CH/LDL/101212 and M41. All these re¬ 
sults suggest that strains ck/CH/LNM/091017 and ck/CH/LDL/ 
101212 are closely related to the H120 vaccine strain. 

3.3. Strain ck/CH/LHLJ/100902 showed a close genetic relationship 
with pathogenic M41 

The percent nucleotide similarity between strain ck/CH/LHLJ/ 
100902 and HI20 for the full-length genomes was 91.3%; however, 
the percent similarity was up to 99.7% between strain ck/CH/LHLJ/ 
100902 and M41. In addition, in all of the nine trees constructed 
for the 5'-UTR, Gene 1, SI, S2, Gene 3, M, ORF5, N and 3'-UTR, 
the strain ck/CH/LHLJ/100902 constantly fell into the same clusters 
as the pathogenic M41 strain, and both belonged to the Mass-type. 
Pairwise comparison of the SI protein gene revealed that strain ck/ 
CH/LHLJ/100902 had 3 and 41 nucleotide mutations with respect 
to M41 and HI20, respectively. Taken together, these results dem¬ 
onstrate that strain ck/CH/LHLJ/100902 exhibits a close genetic 
relationship to pathogenic M41. 
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Fig. 2. Phylogenetic tree constructed on the basis of the full-length genomes of the IBV isolates and Turkey coronavirus (TCoV) reference strains using the neighbor-joining 
method. IBV strains ck/CH/LNM/091017, ck/CH/LHLJ/07VII, ck/CH/LDL/101212 and ck/CH/LHLJ/100902 are highlighted in bold. Numbers on the branches represent the 
percentage of 1000 bootstrap samples supporting the branch. 
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Fig. 3. Neighbor-joining tree of non-overlapping portion of the 5'-UTR, Gene 1, SI, S2, Gene 3, M, Gene 5, N and 3'-UTR of nine Massachusetts strains. Numbers on the 
branches represent the percentage of 1000 bootstrap samples supporting the branch: only values >50% are shown. 
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Fig. 3. ( continued ) 


3.4. Markers were found that distinguish between pathogenic and 
vaccinal Mass-type strains 

Pathogenic and non-pathogenic Mass-type strains were clus¬ 
tered into different clades by phylogenetic analysis of full-length 
genomic sequences and the 5'-UTR, Gene 1, SI, S2, Gene 3, M, Gene 
5, N and 3'-UTR. In addition, insertions and deletions were also ob¬ 
served that distinguished between the genomes of pathogenic and 
non-pathogenic Mass-type strains, as illustrated in Fig. 4 and Sup¬ 
plementary material 1. In non-pathogenic strains, five deletions: of 
31 nucleotides, 5 nucleotides, 9 nucleotides, 10 nucleotides, and 9 
nucleotides, respectively, were observed to be located in the nsp3 


of Gene 1. They were found to occur between genomic positions 
2897-2929, 3219-3225, 3241-3251, 3257-3268, and 3398-3407, 
respectively, by comparing the sequences with the homologous re¬ 
gions of pathogenic strains. In contrast, a 3-nucleotide and a 9- 
nucleotide insertion were found in nsp6 and between the M gene 
and gene 5, respectively. Additionally, a cluster of insertions was 
found at the 3'-UTR region in the non-pathogenic strains. These 
changes might not only account, at least partly, for viral fitness 
when the pathogenic virus has become adapted to egg embryos 
(Cavanagh et al., 2005; Hewson et al„ 2012), but may act also as 
molecular markers, able to differentiate between vaccine and field 
strains, for diagnostic purposes. 
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Fig. 4. Deletions or insertions in the IBV genomes of Mass-type strains. The sites at which the deletions or insertions are located are indicated by respective arrows and 
numbered. 
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3.5. Strain CK/CH/LHLJ/07VII is a mosaic 

Comparative sequence analysis based on full-length genomic se¬ 
quences and the sequences of the 5'-UTR, Gene 1, SI, S2, Gene 3, M, 
Gene 5 and N showed that strain CK/CH/LHLJ/07VII clustered with 
pathogenic Mass-type strains. The exceptions were the trees con¬ 
structed using the 3'-UTR and SI gene, in which CK/CH/LHLJ/07VII 
was grouped with non-pathogenic strains; this suggests that a pos¬ 
sible recombination event may have occurred. Thus the N and 3'- 
UTR of CK/CH/LHLJ/07VII were carefully compared pairwise with 
those of strains ck/CH/LNM/091017, ck/CH/LDL/101212, ck/CH/ 
LNM/091017, H120 and M41. Parallel to the result of the phyloge¬ 
netic analysis, ck/CH/LHLJ/07VII showed high similarity with M41 
at the 5'-end of the N gene; however, it showed high similarity with 
vaccine strain H120 at the 3'-end of the N gene (Supplementary 
material 2). The data strongly suggest that CK/CH/LHLJ/07VII arose 
from a homologous RNA recombinant event that involved a tem¬ 
plate switch between Massachusetts pathogenic M41-like and 
non-pathogenic HI 20-like strains. We located the switch site at 
the 3'-end of the N gene (Supplementary material 2), which implies 
that the template switch occurred within the N gene. 

3.6. Strain CK/CH/LHLJ/07VII has experienced evolution over time 

The percent nucleotide similarity between strain CK/CH/LHLJ/ 
07VII and Hf 20, and CK/CH/LHLJ/07V11 and M41, for the full-length 
genomes was 92.5% and 99.4%, respectively. Alignment revealed 
that a 3-nucleotide insertion was located in nsp6 of the CK/CH/ 
LHLJ/07V1I strain between genomic positions 24009 and 24013 
(Supplementary material 3). In addition, the SI gene of strain CI</ 
CH/LHLJ/07V11 showed extensive mutations by pairwise compari¬ 
son (Supplementary material 4) though it was grouped with 
HI20 by SI gene phylogenic analysis. These and our previous re¬ 
sults (Liu et al., 2009) showed that, with the exception of the occur¬ 
rence of recombination events, CK/CH/LHLJ/07VII has experienced 
multiple mutations and deletions in the genome over time. 

4. Discussion 

Understanding the evolution of Mass-type 1BV is important be¬ 
cause not only is this virus circulating worldwide but information 
on virus genomics will aid our understanding of the evolution and 
emergence of 1BV with infectious potential in vaccinated chicken 
flocks. In this study, we focused on the full-length genomic se¬ 
quences of four IBV isolates which had been shown to be of the 
Mass-type by SI gene analysis (Liu et al., 2009; Ma et al., 2012; 
Sun et al., 2011). Based on the high degree of similarity in the full 
genomic sequence, it could be concluded that two IBV strains, ck/ 
CH/LNM/091017 and ck/CH/LDL/101212, were very similar to the 
vaccine strain H120. They might therefore represent re-isolations 
of vaccine strains, although they were isolated from vaccinated 
chickens with respiratory disease. Similarly, IBV strains that 
showed a very close relationship to the HI20 vaccine strain have 
been isolated from unvaccinated broiler flocks in Slovenia with 
respiratory problems (Krapez et al., 2010). Alternatively, these 
strains might be variants of vaccine strains that have resulted from 
accumulated point mutations after several passages in chickens. A 
few key mutations in the SI subunit of the spike protein might re¬ 
sult in a change to a new serotype, which is defined as a lack of 
cross-neutralization with specific sera against different IBV sero¬ 
types (Cavanagh et al., 1992). The point mutations found in the 
genome that distinguish between the two isolates and vaccine 
strain HI20 might be the result of adaptive evolution driven by 
the host immune response when the vaccine strain was transmit¬ 
ted among chickens. Adaptive evolution is the process by which 


genetic changes in the viral genome leading to a more fit virus pop¬ 
ulation become fixed over time, and it has been reported to occur 
in many coronaviruses (Hasoksuz et al., 2007; Lee and Jackwood, 
2001; Shi et al., 2006; Tang et al., 2009; Zhang et al., 2006). 

As shown in this study, and as also occurs in other countries 
(Dolz et al., 2008; Rimondi et al„ 2009; Roussan et al., 2009), the 
isolation of Mass-type IBV is expected, because attenuated vaccine 
strains are used extensively in chicken flocks in China. However, 
vaccination is not likely to be the only explanation for the circula¬ 
tion of Mass-type virus, because ck/CH/LHLJ/100902 and CK/CH/ 
LHLJ/07VII were most closely related to a Massachusetts patho¬ 
genic type strain, M41. The isolation of a Massachusetts pathogenic 
strain from H120-vaccinated chicken flocks may be due to vaccina¬ 
tion failure in these flocks (Han et al„ 2011). Alternatively, molec¬ 
ular studies have shown that only a few changes in the amino acid 
composition of the SI spike protein can result in immune failure, 
even when the majority of the virus genome remains unchanged 
(Cavanagh et al., 1992). Our findings showed that mutations had 
occurred in the genomes of both ck/CH/LHLJ/100902 and CK/CH/ 
LHLJ/07VII, especially in the SI genes of CK/CH/LHLJ/07VII though 
it was in the same group with H120 strain in the phylogenetic 
analysis, implicating that strain CK/CH/LHLJ/07VII has experienced 
evolution over time. It has been reported that amino acid changes 
may result from immunological pressure caused by the wide¬ 
spread use of vaccines (Cavanagh et al„ 1988, 2005). 

The occurrence of recombination events is another process that 
allows new strains to emerge, and this has been well documented 
in IBV (Hughes, 2011; Jia et al., 1995; Kottier et al., 1995; Kusters 
et al., 1990; Wang et al., 1993) and other coronaviruses (Fu and 
Baric, 1992,1994; Makino et al., 1986). It is believed that the con¬ 
ditions for recombination amongst IBV strains in the field are as 
follows: an extremely large number of chickens, most maintained 
at high density; the ease of spread of the virus; and serotype co¬ 
circulation, including proof of co-infection with more than one 
serotype in a given flock (Cavanagh, 2007). In China, intensive 
chicken farms are concentrated in many provinces, including 
Heilongjiang, where ck/CH/LHLJ/07VII was isolated (Liu et al., 
2009). Almost all the chickens in China receive Mass-type vaccines 
at a very young age and subsequently receive this vaccine a couple 
more times during the rest of their life span. Therefore the vaccine 
virus exists constantly in chickens; it may persist in various inter¬ 
nal organs for 163 days or longer (Cavanagh and Gelb, 2008). Gen¬ 
erally, vaccination using the HI20 vaccine provides full protection 
against pathogenic Mass-type pathogenic IBVs and prevents the 
same type of pathogenic strain from being replicated and spread¬ 
ing in the flocks. However, a single amino acid substitution at posi¬ 
tion 63 of the SI subunit of the spike has resulted in escape 
mutants of Mass 41 (Cavanagh et al., 1988). This may have 
occurred in the case of the ck/CH/LHLJ/07VII SI gene (Liu et al., 
2009), which might have made it possible for both pathogenic 
and vaccine strains to co-exist in a given flock, leading to the 
occurrence of recombination. Similarly, an escape mutant could 
be a result of adaptive evolution driven by the host immune 
response. Consequently, it is likely that genetic changes due to 
adaptive evolution and recombination both contributed to the ori¬ 
gin and evolution of strain ck/CH/LHLJ/07VH: it is possible that 
adaptive evolution created a mutant, followed by recombination 
between Mass 41- and H120-like strains to create a novel virus. 

The recombination events from which the CK/CH/LHLJ/07VII 
virus resulted can be explained by a scenario in which the recombi¬ 
nation may have involved two parental viral strains, with initiation 
of RNA replication in a M41-like template of either negative or po¬ 
sitive polarity (Liao and Lai, 1992). This would be followed by 
switching of the polymerase-nascent cRNA complex to an HI 20- 
like virus template. The switch may have occurred at the 3'-end 
of the N gene. In general, for a recombinant virus to emerge and 
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establish itself in the field, it must be viable and have selective 
advantages. It has been reported that uptake of canine coronavirus 
(CCV) sequences by type II feline coronavirus (FCoV) may have led 
to increased viral fitness when compared with type 1 FCoV 
(Herrewegh et al., 1998). Recombination can also result in the 
emergence of new strains with distinct characteristics, such as 
pathogenicity and tissue tropism (Worobey and Holmes, 1999). In 
addition, in the CoV genome, as with most RNA viruses, the 3'-UTRs 
usually harbor important structural elements that are involved in 
replication and/or translation (Chang et al., 1994; Raman et al., 
2003; Raman and Brian, 2005; Goebel et al., 2006; Ziist et al., 
2008). In IBV, the 3'-UTR-binds to the N protein, which is essential 
for the synthesis of negative-strain viral RNA. Perhaps the acquisi¬ 
tion of the 3' end of the N gene and the 3'-UTRfrom H120-like virus 
by an M41 -like virus (e.g. CK/CH/LHLJ/07VII) can alter the efficiency 
of viral replication. This alteration may in turn affect pathogenicity. 
However, it remains unknown whether this is the true origin of CI</ 
CH/LHLJ/07VII, and therefore this strain is of particular importance 
to the surveillance of IBV in China. It will be of equal importance to 
examine future outbreaks of IBV in chickens by full-length genomic 
sequence analysis in the context of novel recombination events 
among IBV strains. Furthermore, investigations using reverse ge¬ 
netic systems might provide further insight into these issues and 
increase our understanding of IBV pathogenesis. 
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